Distributed Software RAID Architectures for Parallel I/O in Serverless Clusters*
نویسندگان
چکیده
In a serverless cluster of computers, all local disks can be integrated as a distributed software RAID (ds-RAID) with a single I/O space. This paper presents the architecture and performance of a new RAID-x for building ds-RAID. Through experimentation, we evaluate the RAID-x along with RAID-5, chained-declustering, and RAID-10 architectures, all embedded in a Linux cluster environment. All four ds-RAID architectures aim to scale in aggregate I/O bandwidth. The RAID-x is unique with its orthogonal striping and mirroring (OSM) architecture. The reliability comes from orthogonal mirroring, while the bandwidth is enhanced from distributed striping. To support single I/O space, we have developed cooperative disk drivers (CDD) at the Linux kernel to enable fast remote disk accesses without using a central file server such as the NFS. The performance of the RAID-x is experimentally proven superior in three areas: (1) significant improvement in I/O bandwidth especially in parallel write operations, (2) pipelined mirroring in the background with low overhead, and (3) enhanced scalability and reliability in cluster computing with a single I/O space. These claims are supported by Andrew and Bonnie benchmark results, obtained on the USC cluster of 16 Linux PCs. Reliable cluster middleware and Linux extensions are developed to enable not only single I/O space, but also shared virtual memory and global file hierarchy. Toward this end, we compare the RAID-x with four related parallel or distributed RAID projects: Digital Petal, Berkeley Tertiary Disk, Princeton TickerTAIP, and HP AutoRAID. Their relative strengths, shortcomings, and applications are discussed along with suggested further research.
منابع مشابه
Orthogonal Striping and Mirroring in Distributed RAID for I/O-Centric Cluster Computing
-This paper presents a new distributed disk-array architecture for achieving high I/O performance in scalable cluster computing. In a serverless cluster of computers, all distributed local disks can be integrated as a distributed-software redundant array of independent disks (ds-RAID) with a single I/O space. We report the new RAID-x design and its benchmark performance results. The advantage o...
متن کاملReliable Cluster Computing with a New Checkpointing RAID-x Architecture
In a serverless cluster of PCs or workstations, the cluster must allow remote file accesses or parallel I/O directly performed over disks distributed to all client nodes. We introduce a new distributed disk array, called the RAID-x, for use in serverless clusters. The RAID-x architecture is based on an orthogonal striping and mirroring (OSM) scheme, which exploits full-bandwidth and protects th...
متن کاملRAID-x: A New Distributed Disk Array for I/O-Centric Cluster Computing
A new RAID-x (redundant array of inexpensive disks at level x) architecture is presented for distributed I/O processing on a serverless cluster of computers. The RAID-x architecture is based on a new concept of orthogonal striping and mirroring (OSM) across all distributed disks in the cluster. The primary advantages of this OSM approach lie in: (1) a significant improvement in parallel I/O ban...
متن کاملProviding Single I/O Space and Multiple Fault Tolerance in a Distributed RAID
Commodity EIDE disks provide low cost storage but are severely limited in bandwidth and cannot be made fault-tolerant. On the other hand, conventional RAID devices provide reliability and performance but worse price/performance figures. A cluster of PCs can be seen as a collection of networked low cost disks; such a collection can be operated by proper software so as to provide the abstraction ...
متن کاملPerformance Evaluation of Software RAID vs. Hardware RAID for Parallel Virtual File System
Linux clusters of commodity computer systems and interconnects have become the fastest growing choice for building cost-effective high-performance parallel computing systems. The Parallel Virtual File System (PVFS) could potentially fulfill the requirements of large I/O-intensive parallel applications. It provides a high-performance parallel file system by striping file data across multiple clu...
متن کامل